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COHERENT QUANTUM TOMOGRAPHY 


JOONAS ILMAVIRTA 

Abstract. We discuss a quantum mechanical indirect measure¬ 
ment method to recover a position dependent Hamilton matrix 
from time evolution of coherent quantum mechanical states through 
an object. A mathematical formulation of this inverse problem 
leads to weighted X-ray transforms where the weight is a matrix. 
We show that such X-ray transforms are injective with very rough 
weights. Consequently, we can solve our quantum mechanical in¬ 
verse problem in several settings, but many physically relevant 
problems we pose also remain open. We discuss the physical back¬ 
ground of the proposed imaging method in detail. We give a rig¬ 
orous mathematical treatment of a neutrino tomography method 
that has been previously described in the physical literature. 


1. Introduction 

We present a new imaging modality which we call coherent quan¬ 
tum tomography. The reason for this name will be discussed in sec¬ 
tion [3j Our goal is to model and solve the following problem: An object 

C M n is to be imaged nondestructively. Properties of the object are 
described by a (hermitean) matrix valued function H on O, a quantum 
mechanical Hamiltonian. Particles are fired through and they expe¬ 
rience time evolution due to the Hamiltonian H and the final state of 
the particle is measured. How much about the function H can we infer 
from this data? 

Of course, much depends on what initial states in the state space are 
possible and what kinds of measurements are made. We will mainly 
focus on two kinds of situations, a general case with ideal data and 
a more realistic one with neutrino oscillations, but the discussion of 
section [2] is valid for any data. These problems will be described in 
more detail below. 

The imaging modality we discuss is a quantum mechanical one and 
relies on coherence between states in a multiple dimensional state space. 
We restrict our attention to finite dimensional state spaces, but many 
practically interesting systems are indeed finite dimensional. The prac¬ 
tical setting resembles that of X-ray tomography (sending rays through 
an object and measuring what happens to them) but the physical 
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phenomena behind the measurement and the mathematical model are 
rather different. Our analysis does, however, lead to a certain type of 
X-ray transform. 

The application we have in mind is imaging the Earth with neutri¬ 
nos. The idea is to produce neutrinos somewhere at the surface (in 
a nuclear reactor or a particle accelerator) and measure neutrinos of 
different flavours coming from this source. Neutrinos come in three 
flavours, electron, muon and tau neutrinos (u e , z/ M , v T ) and typical nu¬ 
clear reactions produce electron neutrinos or electron antineutrinos. 
These neutrino flavours will oscillate — turn into each other while 
propagating — and details of this oscillation depend on the medium 
they traverse. The problem is then to recover properties of the medium 
from neutrino oscillation data. For basics of neutrino physics, we refer 
to the book [T5] and section 

Neutrino tomography of the Earth is not a new idea. Tomography of 
our planet using neutrino absorption has been discussed in [06) [22, 05] 
and using matter effects in neutrino oscillations in [35j ES, 25] • A dis¬ 
cussion and comparison of these two methods can be found in [66]. 
It has also been proposed that geoneutrinos produced in nuclear de¬ 
cays in the Earth could be used for imaging the Earth [8] and that 
neutrino tomography could be used as a means of earthquake predic¬ 
tion [63]. A spectrometric approach to using neutrinos in imaging the 
Earth was recently proposed in m- We aim to bridge the gap be¬ 
tween the physical literature of the idea of neutrino tomography and 
the mathematical literature of inverse problems and imaging by pro¬ 
viding a physically meaningful and mathematically rigorous discussion 
of the neutrino oscillation tomography problem. 

Due to the quantum mechanical nature of the problem, our data is 
phaseless. For some other results concerning phaseless data, we refer 


3.1 


Problems considered in this paper can also be posed on manifolds, re¬ 
placing 0 by a compact Riemannian manifold with boundary and lines 
by geodesics. Euclidean geometry is most relevant for many physical 
applications, so for the sake of simplicity we restrict our attention to 
it. A reader familiar with differential geometry will observe that many 
arguments carry over to Riemannian manifolds. We end up introducing 
some new ray transform problems (see problems 5.1 and 3.1), which 
can also be asked on Riemannian manifolds. We provide a partial so¬ 
lution in the Euclidean setting in theorems 7.2 and 7.3[ See section [7] 
for more details on ray transforms. 


1.1. Outline. We will introduce the quantum mechanical problem in 
detail in section [2j and then we will have the language to state our 
main results in section 2.1 We will overview the physical aspects of 


the proposed indirect measurement in section [3} Section [4] is devoted 
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to converting onr phase free data to a more useful form. In section [5] 
we will prove some of our main results using a pseudolinearization 
argument. This will involve some weighted X-ray transforms where the 


weight is a matrix; these are introduced in section 5.1 In section 5.2 


we will reduce our quantum mechanical problem to a ray transform 
problem. We will also discuss the linearized problem in section 5.3 Our 


quantum mechanical problem involves a time ordered exponential, and 
we investigate the same problem without time ordering in section [6j In 
section [7] we will show that some of our new weighted X-ray transforms 
are injective with very rough weights. 


2. The quantum mechanical problem 

We will use natural units: c — h— 1. Consider a point particle mov¬ 
ing in n-dimensional Euclidean space, with position at a time t given 
by 7 (t) = Xq + tv € M n , where \v\ = 1. The particle is ultrarelativistic 
- its speed is practically the speed of light — and it is assumed not 
to scatter or be absorbed. 

Besides moving in space, the particle has a state in a quantum me¬ 
chanical state space where time evolution is given by the Schrodinger 
equation. Vectors in the state space will be referred to as states. 

Note that the dimension n of the ambient space and the dimension of 
the state space N need not have anything to do with each other. The 
dimension N need not be high to accommodate interesting physical sys¬ 
tems. The case N = 1 is in a sense empty, but two state systems (such 
as spin) provide an example where N — 2 and for neutrino oscillations 
N = 3 is relevant (excluding possible sterile neutrinos). 

Suppose that the Hamiltonian governing the time evolution in the 
quantum mechanical state space C N is a hermitean N xN matrix H{x ) 
depending on the position of the particle in M n . In this case the 
Schrodinger equation for the state T(f) 6 C N is 

idtV(t) = i/( 7 (t))T(t). 

For notational simplicity, let us drop the line 7 and consider H a func¬ 
tion of time. Let Uh^Ti) be the time evolution operator (solution 
operator) for which T(f) = Un(t, fo)d'o is the unique solution to 

itf'(t) = H(i)V(t) 

T(f 0 ) = V IV 

Formally, we may write Uh as a time ordered exponential 
U H (t 2 ,ti) = Tex p £ H(~f(t))dt 

^For a discussion of the time evolution operator as a time ordered exponential, 
see textbooks in quantum mechanics, such as |49j . The time ordered exponential 
has a series representation known as the Dyson series. 
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If [H(t), H(s)] = 0 for all t, s e then this is just the matrix 

exponential. We will consider unordered time evolution, where Texp 
is replaced by exp, in section [ 6 ] 

Consider the time evolution from creation of a particle at t — 0 to 
observation at t = T. Suppose there is a set ^ C C N of possible initial 
states T(0) and a set & C C N of observable final states. We assume 
that we can prepare the particle to any initial state in JA at will. If 
the final state 'b(T) = Uh(T , O)'b(O) is measured via an operator A, we 
can observe (from the statistics of a large number of measurements) the 
norms of the projections of T(T) to eigenspaces of A. If the spectrum 
of A is non degenerate, we can measure |a*T(T )| 2 for all eigenvectors 
(eigenstates) a of A. The set & consists of all nondegenerate eigenvec¬ 
tors of all observables that can be used for measurements. 

Consider a domain hi C R n which we cannot enter but through 
which we can fire particles. Our aim is to obtain information about 
the Hamiltonian H in O from measurements of particles sent through it. 
This is a nondestructive measurement problem or an inverse problem. 

We assumed the particle to be ultrarelativistic (|u| = 1) and also 
scattering and absorption to be negligible. This means essentially that 
the medium only affects the particle in its time evolution in the state 
space C N . As mentioned in section [lj the main example we have in 
mind is neutrinos travelling through the Earth, the objective being to 
recover the inner structure of the planet from neutrino oscillation mea¬ 
surements. Information from such measurements should, of course, be 
combined with similar measurements by other methods (most impor¬ 
tantly seismic imaging), but here we focus on the quantum mechanical 
tomography problem. 

We adopt the following notation: For a line segment 7 : [0, T] —> 0, 
we denote Ufj := C^(T, 0 ), the time evolution operator over the entire 
length of the line. 

The problem is now this: 

Problem 2.1. For every line segment 7 : [0, T] —)• f2 with endpoints 
on dfl we measure \Q*Uf 1 'i l \ for all T e J? and <f> e &. How jnuch 
can we infer about the Hamiltonian H: Q C NxN from this data? 


We will assume throughout that the Hamiltonian only depends on 
the position. It is also possible (and physically relevant, see section 3.2) 
that the Hamiltonian depends on the direction v. If the dependence 
on v is polynomial, the X-ray tomography problems that arise below 
will have to be generalized to tensor tomography problems. Recovering 
a tensor held from its integrals over all lines (or geodesics) can only be 
possible up to potential tensors. This natural gauge condition is the 
only obstruction in many situations; see for example [531, 54j, HOj 0Tj. 
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Definition 2.2. We say that the pair of subsets J 1 C C N gives ideal 
data if for any two unitary matrices U and V the condition \a*Ub\ = 
|a*H5| for all and 6 e / implies U = e lip V for some <p G M. 

We also assume perfect data in the sense that the measurements are 
done over every line through the domain. Partial data problems with 
restricted sets of lines are also physically relevant, but we do not pursue 
them here. 

The first problem is inferring as much as possible about U]j from the 
data. It follows from lemma 14.11 below that in the case of ideal data we 
can recover the matrix W H up to a phase factor. This implies that if we 
add a position dependent scalar multiple of the identity matrix, f(x)I, 
to H(x), the data is not changed. We are therefore completely blind 
to multiples of the identity in H, and the true problem is whether this 
is the only obstruction to recovering H. 


2.1. Results. We will show in theorem 5.2 that ideal data (in the sense 
of definition 2.2 above) is enough to determine the trace free part of 
the Hamiltonian matrix H everywhere, provided that certain weighted 
X-ray transforms are injective. 

Theorem 5.2: Let 12 C M n , n > 2, be a convex 
bounded smooth domain and write M — Cl. Suppose 
the sets J?, & C C N , N > 2, give ideal data. Let 
H: M —>■ C NxN be continuous and pointwise hermitean. 

Assume that we know \a*U]jb\ for all a G b G JX and 
every line 7 : [0,T] —> Cl through 12. If X w is injective 
for any continuous weight W : SM — » SU(N 2 ), then the 
data uniquely determines the trace free part of H. 


Here X w is the X-ray transform with weight W (see section 5.1 for 
a definition) and SM — M x S '” -1 denotes the sphere bundle of M. 

consider the special case where H{x) = 


I 11 theorem 5.4 


we 


will 


H 0 (x) + f(x)G(x), where Hq and G are known matrix functions and / is 
an unknown scalar function; the result is that ideal data determines /, 
provided that G is nowhere a multiple of the identity. 

Let 12 C M n , n > 3, be a strictly convex 


Theorem 5.4 


bounded smooth domain and write M — 12. Suppose the 
sets y, & C C N , N > 2, give ideal data. Fix any a > 0. 
Let H 0l G: M —y C NxN be C 1 ’"-smooth and pointwise 
hermitean. Let H{x) = H 0 (x) + f(x)G(x) for a function 
/ G C 1 ,a (M). Assume that we know \a*U]jb\ for all 
a G J^", 6 G / and every line 7 : [0, T] —y 12 through 12. 
If Hq and G are known and G is nowhere a multiple 
of the identity, then the data uniquely determines the 
function /. 


We will discuss the linearized problem in section 5.3 
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In section [ 6 ] we will introduce an unordered version of our main 


problem 2T We emphasize that the unordered problem 6 T is neither 
a generalization nor a specialization of problem For this prob¬ 

lem we will only be able to solve a linearized version; the solution of 
the unordered problem linearized at a constant background is given in 
theorem 16.131 

6A3\ 


Theorem 


n 


> 2 , be a bounded 


Let LI c R 

domain. Let H G C(Ll,C NxN ) take values in hermitean 
matrices and let H 0 be a fixed hermitean matrix. Sup¬ 
pose that for every line 7 : [0, T] — >■ 0 , with endpoints 
on dLl we know 

e‘™»dexp_ (THo i [ T H( 7 (t))dt 
Jo 

up to an unknown multiple of the identity matrix. Then 
we can recover the trace free part of H(x ) for every 
x G Ll. 

Finally, we will analyze the X-ray transform problem with matrix 
weights that arises in theorem 5.2. We will show in theorem 7.2 
that in Euclidean domains of dimension three or higher a continu¬ 
ous C^-valued function is determined by its weighted integrals over 
all lines when the weight is an invertible C NxN matrix depending on 
position and direction in a sufficiently continuous way. I 11 fact, we will 
prove a local injectivity result for this weighted transform; this is stated 


separately in theorem 7.3 


Theorem 17.21 Let M be the closure of a bounded 
convex domain in M”, n > 3, and let /3 > 0. Let 
W G C^(M, Lip(S' r,_1 , C NxN )) be invertible at every 
point on SM — M x S n_1 . (This regularity assump¬ 
tion is true, in particular, if W G C 1, P(SM,C NxN ).) If 
/ G C(M, C N ) satisfies X w f = 0, then / = 0. 

Combining theorems |5.2 and 7.2 


we obtain the following theorem 


which is our main answer to problem 2.1 


Theorem 2.3. Let LI be a bounded convex domain in M n ; n > 3, and let 
(3 > 0. Suppose H, H G C 1,/3 (f2, C NxN ). Suppose the sets C N , 

N > 2, give ideal data. The two Hamiltonians H and H give the same 
data in the sense of problem 


2.1 


if and only if H = H + (pi for some 


scalar function <p: LI —$■ 


3. Physical discussion 

More detailed physical remarks and background in addition to sec¬ 
tion [ 2 ] are in order, and we give them in this section. A reader who is 
interested in solving the mathematical problem posed above instead of 
justifying it physically may wish to skip this section. 
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In our model a quantum mechanical particle has well defined posi¬ 
tion and momentum at all times. This situation is in fact forbidden 
by the uncertainty principle but it is an excellent approximation. In a 
more careful model particles would have to be modeled by wave pack¬ 
ets which are nontrivially distributed in position and momentum. We 
have (implicitly) assumed that uncertainty in momentum is very small 
in comparison to the mean value of momentum. Therefore momentum 
of a particle as a single vector is a relatively well defined concept, and 
if momentum is large, even very small relative uncertainty allows for 
substantial absolute uncertainty. The necessary amount of uncertainty 
in position may be so small that a particle can essentially be consid¬ 
ered point-like. Well defined position and momentum (which could 
also be informally described as a plane wave supported at a point) 
are a fairly good approximation if momentum is large and the back¬ 
ground medium does not have features on shorter length scales than 
position uncertainty. When the Earth is imaged by neutrinos, these as¬ 
sumptions are easily met. (We will also discuss momentum dependent 


Hamiltonians in section 3.2 


Our mathematical model is only relevant in physical settings where 
the particles are ultrarelativistic and experience very little scattering or 
absorption. Such particles can be reasonably approximated by point¬ 
like particles with a quantum mechanical state as modeled in section [2] 
This is a common leading order approximation in particle physics. 

The original motivation behind this article — to which the model 
is not restricted — is to image the Earth by sending rays through it. 
Most particles experience so much absorption or scattering that prac¬ 
tically no signal makes it through the planet. For neutrinos, however, 
absorption and scattering are very weak and quite difficult to measure. 
Intensity of a neutrino beam is not very sensitive to medium properties, 
so X-ray type imaging is difficult. On the other hand, a quantum me¬ 
chanical phenomenon called neutrino oscillation is sensitive to electron 
density in the medium. This calls for quantum mechanical measure¬ 
ments of the type described in section [2} The fact that quantum me¬ 
chanical evolution is sensitive to something but intensity is not shows 
that the underlying physical phenomenon is quantum mechanical inter¬ 
ference. Such interference is only possible for a coherent superposition 
of quantum states, whence the term “coherent quantum tomography”. 
Neutrino oscillation will be discussed in much more detail in section [Til 
below. 

The observation that we could — in principle at least — see more 
things with neutrinos than other particles due to the weakness of their 
interactions is not a new one. The Universe became transparent to 
photons at the age of about 380000 years and optical observations do 
not allow to see a younger Universe directly. For neutrinos the corre¬ 
sponding age of the Universe is about one second, whence neutrinos 
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“see” significantly older structures than photons do. The problem is 
that neutrinos are much, much more difficult to measure than photons; 
more information is available but it is in a form difficult to measure. 
For more about neutrino astrophysics, see [15, 26, 61J. This is the case 
for imaging the Earth with neutrinos, too: current technology is in¬ 
sufficient for very precise measurements but there is no fundamental 
physical obstruction to seeing new things with neutrinos. 

The usual computerized X-ray tomography is based on measuring 
attenuation of photon intensity. Photons can be replaced with neutri¬ 
nos as mentioned in the introduction, and computerized tomography 
with muons has been proposed in |58] - We emphasize that neutrino 
absorption tomography (as discussed in [3SI [22,13 [663) is significantly 
different from neutrino oscillation tomography discussed in this arti¬ 
cle. For physical articles on neutrino oscillation tomography, we refer 
to [35, [2BJ 123 Neutrino tomography could complement existing 
geophysical imaging methods as discussed in the cited articles. 

Of other existing imaging methods based on coherence we mention 
optical coherence tomography EU and quantum optical coherence to¬ 
mography [60]. There are also other kinds of inherently quantum me¬ 
chanical imaging modalities which often go by the name quantum to¬ 
mography (see eg. [521 [TO , 20]). 

Our imaging method is also related — from a physical point of view 
- to inverse scattering problems for the Schrodinger equation (see 
eg. mw- We study quantum mechanical particles that obey the 
Schrodinger equation, but scattering is not a very significant phenome¬ 
non for the present imaging problem. Moreover, in our problem quan¬ 
tum mechanical time evolution takes place in a finite dimensional state 
space instead of a Hilbert space of functions. 


3.1. Neutrino oscillation. For an introduction to neutrino physics 
and in neutrino oscillation, we refer to [13 El 123 123 E3 E 123 12] • 

We restrict our attention mostly to a quantum mechanical description 
of neutrino oscillation; quantum held theoretical descriptions can be 
found, for example, in [T51 llj. 

There are three kinds of neutrinos (z/ e , zz M , v T ) and the state space 
(excluding motion) of a neutrino is thus a three dimensional complex 
space. We think of this space as C 3 and identify the aforementioned 
neutrino types with the standard basis vectors ei,e 2 ,e 3 . This is the 
so-called flavour basis, and the corresponding basis states (u e , z/ M , u T ) 
are known as flavour neutrinos. A neutrino state is a superposition of 
different flavour neutrinos. 

Neutrinos are typically produced as flavour neutrinos, so in our quan¬ 
tum mechanical model J? C {ei,e2,e 3 }. Neutrinos are typically also 
observed as flavour neutrinos; if all flavours can be measured, we have 
& = (ei, e 2 , £3}. Flavour basis is the natural choice if we only consider 
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creation and annihilation in weak charged current reactions (mediated 
by W ± bosons); if weak neutral current processes are included, the 
picture is more complicated. 

It is sometimes convenient to study neutrino physics in a two-flavour 
model and it is possible that there are more than three neutrino flavours 
(the additional ones are known as sterile neutrinos). If there are sterile 
neutrinos, they cannot be produced or observed directly, so the basis 
vectors e±, e $,... will not appear in J? and &. 

Let us then discuss the Hamiltonian related to neutrinos. There is 
a hermitean 3x3 matrix M describing neutrino masses. There is a 
matrix U diagonalizing M, so that 

/mi 0 0 \ 

U* MU = 0 m 2 0 , 

\ o 0 m 3 J 

where mi, m 2 , m 3 are the neutrino masses. The eigenstates of the ma¬ 
trix M are known as mass states. It is of crucial importance in neutrino 
physics that the mass states are not the same as the flavour states. The 
flavour neutrinos do not have well-defined masses, but suitable linear 
combinations of them (the mass states) do. The relation of flavour and 
mass states is expressed by the lepton mixing matrix, the Pontecorvo- 
Maki-Nakagawa-Sakata matrix U. 

Suppose the neutrino is created at a momentum^ p > 0 which is 
much larger than any of the masses. If the neutrino had mass m E M, 
then its energy would be E = \jp 2 + rri 2 ~ P + Thus in a good 
approximation we may write the Hamiltonian matrix (recall that the 
Hamiltonian is the energy operator in quantum mechanics) as H = pi + 
^M 2 , where M is the mass matrix introduced above. Since multiples 
of the identity are irrelevant for the time evolution (they only introduce 
a total phase), we can consider the Hamiltonian 

(rn\ 0 0 \ 

(1) h 0 =-U\ 0 ml 0 U*. 

2p \ 0 0 ml) 

This simple Hamiltonian gives rise to a remarkable physical phenome¬ 
non: neutrino oscillation. If a neutrino is produced in a flavour state, 
the probability that it is observed in another flavour state (known as 
transition probability) oscillates in time. It is evident from equation Q 
that the key parameters for neutrino oscillations are the PMNS ma¬ 
trix U and the differences of the squared neutrino masses. The momen¬ 
tum p in ([Tj) can be replaced with the energy E, as these are almost 
the same for ultrarelativistic particles. 

2 One can also assume equal energies instead of equal momenta for the mass 
states. These assumptions are different, but lead to the same results. For a discus¬ 
sion of the assumptions on the initial state, see Enin]- 
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This oscillation is produced by quantum mechanical interference of 
different neutrino mass states. Interference is only possible if the differ¬ 
ent mass states are coherent and in the same place. Different masses 
give rise to slightly different speeds, and different mass states drift 
apart slowly, whence coherence is lost and neutrino oscillation comes 
to an end. Transition probabilities stop oscillating, but the asymptotic 
probabilities are not trivial (not only zero or one). For coherence and 
decoherence in neutrino oscillations, we refer to [20, Section 4]. (We 
wish to point out that lost coherence can be recovered at detection; 
see [D Section 5.4].) 

Neutrino oscillation is a kinematic effect; it requires no interactions. 
The neutrino oscillation described hitherto is the kind that takes place 
in vacuum. Interactions with the medium change the Hamiltonian 
of Q and also the oscillation. It is via these changes that we hope to 
determine properties of the medium using neutrino oscillation. 

If the neutrinos propagate in matter with electron number den¬ 
sity N e , we have to add the potential term 


A o o\ 

(2) 2V2 EG F N e 0 0 0 

\0 0 0 / 

(with a minus sign for antineutrinos) to the Hamiltonian of ([!]), ne¬ 
glecting again multiples of the identity matrix. Here E is the neutrino 
energy and Gp is the Fermi constant. This particular form of the po¬ 
tential is due to only electron neutrinos interacting with electrons in 
the medium via weak charged current; the neutral current potentials 
of electrons and protons cancel each other and the one of neutrons is 
a multiple of identity. For details of neutrino oscillation in matter, we 
refer to [EE Section 9.2], 

Because of the form of the potential in (J 2 J) , neutrino oscillations 
provide a means of measuring electron density as a function of position. 
Replacing momentum with energy, we arrive at the total Hamiltonian 


. fmj 0 0 \ /1 0 0\ 

H(x) = — U 0 ml 0 ]U* + Ef(x) 0 0 0, 

2L \ 0 0 ml) \0 0 0/ 


where / is a scaled electron density. The problem is to recover the 
function / from the data described in section [2j possibly with mea¬ 
surements for several energies E. We emphasize that ideal data is not 
available, so theorem |5.4| is not applicable. 


3.2. Tensor Hamiltonians. So far we have only considered Hamilto¬ 
nians H{x) as functions of x G M. It may happen, however, that the 
Hamiltonian also depends on direction, so that H(x,v) is a function 
of (x, v ) G SM. If there is no a priori knowledge on how H depends 
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on v, problem |2.1 is hopeless, as the time evolution operators along 
intersecting lines have nothing to do with each other. 

A natural assumption is that H(x, v ) is a polynomial in v. From the 
point of view of integral geometry, this corresponds to replacing scalar 
fields by (sums of) tensor fields, the order of the polynomial being the 
order of the tensor field. A particularly simple case to analyze is a 
first order polynomial (corresponding to a sum of a function and a 
one-form). For more details on tensor tomography, we refer to [30) 53]. 

The polynomial assumption is also natural physically, and we illus¬ 
trate this with examples related to particles in an electromagnetic field. 
The Dirac equation in M 3 for a fermion of charge q and mass m can be 
written as id t A = H\ 1/ with the Hamiltonian^ 


(3) 


H(x,p) = q<p(x)I + my 0 + (p — qA(x )) • a. 


The state T G C 4 is the Dirac spinor. For a vector V € M 3 , V ■ a = 
Via 1 + V 2 oi 2 + V 3 a 3 , the 4x4 matrices a 1 , a 2 , a 3 are related to the 
Dirac gamma matrices by a* = 7 °y\ A is the magnetic vector potential 
and ip is the electric scalar potential. The Hamiltonian depends on the 
momentum p, which can be turned into dependence on direction and 
energy using the identity p = Ev. The nonrelativistic counterpart of 
the Dirac equation is the Pauli equation with the Hamiltonian 

(4) H(x,p) = (^(P ~ QM X )) 2 + QP 0)^) 1 + ' o’ 


acting on a spinor f 6 C 2 , where B = V x A is the magnetic field, 
B ■ a = Bic*i + B 2 (J 2 + B 3 cr 3 and the as are the Pauli matrices. 

There are several possible choices for the matrices appearing in (j3j) 
and Q. For completeness, we give one here: 


(?i = 


a 2 = 


03 = 


,,0 


0 1 
1 0 


0 


0 


1 0 
0 -1 ' ’ 


7 “ = a 3 0 I 2 and 
0 1 


7* — 


-1 0 


for i — 1, 2, 3, 


where I 2 is the 2x2 identity matrix. The mathematically oriented 
reader can neglect positioning of indices (upper or lower); we used 


3 This form can be obtained from the standard Dirac equation by describing the 

coupling to the photon field by the gauge covariant four-gradient + iqA^ and 
then solving for the time derivative. 
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some upper indices with the Dirac equation and Dirac matrices to be 
consistent with notation in particle physics. 

The natural problem related to these equations is to recover the elec¬ 
tromagnetic potentials p and A from the data described in section [2] 
The weighted ray transform described in section 5.1 has to be gen¬ 
eralized further: we have to consider matrix valued tensor fields, or 


functions SM —» 


’/VxiV 


that are polynomial in the fiber variable v. 


As with all tensor tomography problems, we expect that there is some 
gauge freedom. But this is to be expected on physical grounds as well, 
since one can only ever measure a (magnetic or other) potential up to 
gauge. 

We state this tensor tomography problem separately: 


Problem 3.1. What happens in problem 5.1 if the unknown functions 
are assumed to be matrix valued tensor fields on SM ? What if we 
assume the weight to satisfy XW = iWQ for a matrix valued tensor 


field Q (cf. remark 5.3)? 


4. Gauge conditions for measurements 

Given J^, & C C N , how much about U G U(N ) can be inferred from 
the knowledge of \a*Ub\ for all a G & and b G J"? The best we can 
hope for is knowing U up to a factor e lif , p G M, which means ideal 
data. 

Lemma 4.1. If JA and & are both the unit sphere in they give 
ideal data. 

Proof. We need to show that if for two matrices [/, V G U(N) we have 
\a*Ub\ = |a*V7>| for all a, b G C N with |a| = |6| = 1, then U = XV 
for some A G C, |A| = 1. By a scaling argument we can drop the 
assumption |a| = \b\ = 1. 

It follows from the condition that for any a G C N a vector is orthog¬ 
onal to Ua if and only if it is orthogonal to Va. Therefore there is 
A a G C so that Va = X a Ua. We wish to show that all the constants A a , 
a G C N \ {0}, are in fact equal. 

If a, b G C N are linearly dependent, then by linearity A a = A&. If 
a, b G are linearly independent, then 


X a Ua + XfoUb — V a-\-Vb — IA(o T 6) — A a +i,U (o T b) — X a j r \ ) U a T A a _)_^I/5, 
from which we get 

(Aa+6 X a )U a (Afo X a+ b)Ub. 

Since Ua and Ub are linearly independent, we have X a = A a+ b = X b- □ 

Problem 4.2. How large do the sets & C have to be to give 
ideal data? The assumption of the previous lemma looks like an overkill. 
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The following lemma shows that if we assume the Hamiltonian to 
be trace free (the trace is invisible from the data anyway), then in 
the case of ideal data we can recover the time evolution operator Ufj 
for every line 7 without any gauge ambiguity. After this reduction we 
may restrict ourselves to trace free Hamiltonians, special unitary time 
evolution and most importantly an inverse problem with no obvious 
gauge invariance. 


Lemma 4.3. Let H C W 1 , n >2, be a convex, bounded domain and 
let M denote the space of continuous hermitean matrix valued functions 
H —y C NxN . Fix the sets JA, FF C C N so that they give ideal data. The 
following are equivalent: 

(1) From the knowledge of \a*U]jb\ for all states a G &, 6 6 / and 
every line 7 : [0, T] A 11 through one can recover the func¬ 
tion H e M up to a function f{x)I for a continuous function 

/: Cl -aR. 

(2) From the knowledge ofUfj for every line 7 : [0, T] —» H through H 
one can recover the trace free function Ft e M. 


Proof. 1 => 2 : If we know U]j = U]j(T, 0) for every line 7 : [0, T } -A Cl, 
then by assumption we can recover Ft up to multiples of the identity. 
But since we know that FI is trace free, there is no ambiguity. 

2 =>■ 1 : We write H(x) = A(x) + f{x)I, where tr(A(a;)) = 0 for 
every x E Cl. Now for any line 7 : [ 0 , T] -A H, 

r T 


u h = ex P / /(7(f))d* 


ui 


Since we have ideal data, we know Ujj up to phase (by lemma 4.1), 
and since A is traceless, the matrix Uf is special unitary. Therefore we 
know the matrix Uf up to multiplication by an element of the group 
Gn C U(l) of Nth roots of unity. 

We know that for very short lines 7 the matrix U\ is very close 
to the identity, so by finiteness of Gn we can recover the matrix Uf 
itself (without any gauge ambiguity) for sufficiently short lines. By 
convexity of the domain every line with endpoints on the boundary 
can be continuously deformed into an arbitrarily short one; it then 
follows from continuity of Ft and finiteness of Gat that we can in fact 
recover U\ for all 7 . By assumption, this information is enough to 
determine A, the trace free part of Ft. □ 


The argument in the proof of the lemma above should also work with 
piecewise continuity, but we refrain from pursuing optimal regularity. 

It is a simple exercise to show that if the sets J", FF C CA give ideal 
data and U, V G U(N), then also the pair (U J?, V<&) gives ideal data. 
From this it follows that if the tomography problem can be solved on 
a domain uj C M n and the trace free part of the Hamiltonian is known 
in O \ w for some larger set C M n , then the problem can also be 
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solved in f2. If the particle travels on a concatenation of several curves, 
the total time evolution operator is simply the composition of time 
evolution operators over each segment. Without ideal data there is 
more gauge freedom than a simple phase, and it is less obvious to find 
such layer stripping results. 


Lemma 4.4. Suppose N = 2 and let e\ and e 2 be the basis vectors 
0/C 2 . If y = {ei} and & = {ei,e 2 } (or vice versa), then the following 
are equivalent for two matrices U,V G U (2) : 

(1) \a*Ub\ = \a*Vb\ for all a G & and b G JC 

(2) There are angles a, ( 3,9 G M. so that 


V = e ie 





Proof. 2 =>■ 1: Follows from a simple calculation. 

1 ==>■ 2: Suppose \a*Uef\ = |a*Vei| for both a G {ei,e 2 }. Since the 
columns of U and those of V are orthonormal and the dimension is 
two, we have also \a*Uef\ = |a*Ce 2 | for both a G {ei,e 2 }. Let 


U = 




and 


V 


f na A b 
yfic vd 


for complex numbers k, A, / 1 , v of unit norm. 

If c = 0, then b = 0 and |a| = \d\ = 1 and the claim is obvious. 
Similarly, if a = 0 (and thus d — 0), the claim follows easily. 

Suppose then that a, b,c,d 7 ^ 0. From the orthogonality relations 


ac + bd = 0 
Kjxac + A ubd = 0 


we obtain A/i = ku. We want to choose the angles a, (3,9 so that 

K _ e i(9+a+/3) 


A = e i( - e+a ~ p \ 


H = e i( - e ~ a+p) and 
v = e i{e - a -P\ 


It is easy to verify that this is indeed possible due to the constraint 
Ap = kv. □ 


Remark 4.5. It is natural to ask if lemma 4.4 holds for N > 2 assuming 
J? = & = {ei,..., ejv}- The analogue of implication 2 1 is easy 

to verify, but the converse is false. Condition 2 of the lemma describes 
a symmetry group (that fixes norms of matrix elements) of dimension 


2IV — 1. 
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We may linearize (w.r.t. the phase of each matrix element) to find the 
dimension of the tangent space of the full symmetry group and thus 
the dimension of that group. To simplify matters, we assume each 
matrix element to be nonzero. Experiments show that the group of 
symmetries preserving the norms of elements of a generic 3x3 unitary 
matrix has dimension 6 , not 5. We have been unable to identify the 


group itself in a spirit similar to condition 2 of lemma 4.4 


The following problem needs to be solved before one can study our 
main problem for N > 3 with = & = {ei,..., ejy} the standard ba¬ 
sis of C iY . This problem with N = 3 is relevant for neutrino oscillations, 
see section l3Tl 

Problem 4.6. Given a unitary matrix U e U(N), N > 3, describe the 
set 

{VeU{N)-\V ij \ = \U ij \ for all ij}. 


Is there a description as simple as that of lemma 4-4 f or N — 2? 


5. Pseudolinearization 

Here we introduce a so-called pseudolinearization that reduces our 
nonlinear problem to a linear one with unknown parameters. It is 
enough to know some general characteristics of these parameters (cer¬ 
tain weight functions) to show injectivity of the linear problem. It also 
gives rise to an iterative reconstruction scheme which we shall discuss 


after theorem 5.2 A similar idea has been previously used for rigidity 
problems isnusz]. 

We will phrase our results in the setting of continuous unknown 
functions. If we assume more regularity, we can assume more regularity 
in the weights of the corresponding ray transform problems. 

5.1. Matrix-weighted X-ray transform. Let Q C M n , n > 2, be 

a bounded smooth domain, and denote M = Cl. Denote by SM = 
S n ~ l x M the unit sphere bundle of M. Fix an integer N > 2 and 
a continuous function W: SM —>■ SU(N). For a continuous function 
/: M —> C N , define the IT-weighted X-ray transform l w f of / by 
letting 

%wf{ 7)=/ W'( 7 (*), 7 (t))f(t)dt 

Jo 

for every unit speed line 7 : [0, T] — y M with endpoints on dM. 

As it will shortly turn out, pseudolinearization leads to this integral 
transform. Weighted and especially attenuated X-ray transforms have 
been studied before (see eg. [121123 El E01 SSI ® Ell E21 153]). but our 
results about X-ray transforms with matrix weights are new to the best 
of our knowledge. The matrix weight is, in a sense, an exponential, 


and could therefore be regarded as an attenuation (see remark 5.3). 
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The attenuated X-ray transform with matrix attenuation is related to 
recovering a matrix valued connection from is parallel transport; for 
results about connection problems, see pm CEO EH [36]. 

The (scalar) weights that arise in the proof of theorem 5.4 are not 
attenuations. We have results for more general weights in section [7] and 
we do not wish to pursue optimal regularity at the expense of clarity, 
so we will not study the attenuated case further here. 


Problem 5.1. Is the transformX w injective on continuous (C N -valued) 
functions whenever n,N> 1? What if we assume the weight W and 
the function f to be smooth? What if M is replaced with a compact 
Riemannian manifold with boundary? 


5.2. Ideal data. We are now ready to pseudolinearize our problem. 
We assume ideal data in the sense of definition 12.21 


Theorem 5.2. Let 12 C M n , n> 2, be a convex bounded smooth domain 
and write M — 12. Suppose the sets J? C C N , N > 2, give ideal 
data. Let H: M —> C NxN be continuous and pointwise hermitean. 
Assume that we know \a*U]jb\ for all a G &, b e and every line 
7 : [0, T] —> 12 through 12. If Z w is injective for any continuous weight 
W : SM —> SU(N 2 ), then the data uniquely determines the trace free 
part of H. 


Proof. By lemma 4.3 we can assume that H is trace free and we 


know Uf for every line 7. We have replaced H with its trace free part, 
so we aim to recover the function H without any gauge restrictions. 

Suppose that for continuous, hermitean, trace free matrix functions 
H,H\ SM —> C NxN we have Ufj = [7T for every line 7 through M. 


We wish to show that H = H. 

Fix any line 7 : [0, T] —> M with endpoints on dM. Define V: [0, T] —> 
SU(N) by 


V(t) = Uf 1 (T,t)Ul(t,0)- 


Since tr (H) = tr (H) = 0, all time evolution operators are in fact special 
unitary. 

Since 


fu] I (T,t)=iU] ] {T,t)H( 1 (t)) and 

fupt.o) = -iHMt))upt,o), 


( 5 ) 
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we have 


( 6 ) 


0 = Ug(T, 0) — U]j(T, 0) 
= v(T) - y(o) 
f T d 

= l V Vm 


= i 


Ul(T , t)(H(^(t)) - HfrmuKt, 0)dt. 


For every (x, v ) G SM, let 7 X>1) : [0, T X)t; ] —)■ M be the unique maximal 
line passing through point x in direction v. Let t x>v G [0, T x>v ] be such 
that r Yx,v(t x ,v) = x. Dehne A, B : SM —y SU(N) by 

A(x,v) — Uff' v (T XjV1 t XjV ) and 
7 B(x,v) = U%' v (t XtV , 0). 

These functions are continuous. Dehne f:M — > C NxN by f(x) = 
H(x) — H(x) and let W: SM —y GL(C NxN ) be dehned by 

W(x,v)(Z) = A(x,v)ZB(x,v) 

for every matrix Z G C NxN . If we endow C NxN with the standard 
inner product, then the linear transformation Z i —> A(x,v)ZB(x,v) is 
a special unitary one since the matrices A(x, v ) and B(x, v ) are special 
unitary. (We may think of C NxN as the tensor product C N ®C N with A 
acting on the first component and B acting on the second one. The 
tensor product of special unitary matrices is again special unitary.) 

Now we have a continuous function /: M —> C N “ and a continuous 
weight W : SM —» SU(N 2 ). Equation (jhj reads now Xwf — 0. Since 
such transforms were assumed to be injective, we have / = 0 and thus 
H — H. □ 

Pseudolinearization can also be used for reconstruction. One first 
starts with an initial guess Hq. The data for Hq and the true Hamilton¬ 
ian H 0 o determines an X-ray transform of H 0 — H^ with weights given 
by H 0 and cf. One can then approximate // 0 — 11^ by inverting 
this transform with weights given by H 0 and H 0 . (This means treating 
the pseudolinearization as if it were the linearization; cf. section 5.3 ) 
This gives rise to an improved guess Hi for H^, and one may proceed 
iteratively to find successive approximations // 2 . //■;. H 4 ,... using the 
same method. Then hopefully Hk -G IL^ as k —> oo in a suitable 
sense. This idea been used to solve inverse problems numerically via 
pseudolinearization in [6l IS]. 


Remark 5.3. The weight that arose in the proof of theorem 5.2 is of a 


special kind. Identifying C NxN with C N ®C N , the weight is W(x,v) = 
A(x,v) <g) B(x,v)*. Let us dehne the differential operator X — v ■ V x 
acting on functions on SM ; this operator is known as the geodesic 
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vector field and it generates the geodesic flow. Comparing equations (|7j) 
and ([5]), we observe that XA = iAH and XB = —iBH , whence XW = 
iWQ, where Q — H ® I -\-1 ® H (which is hcrmitean). The weight can 
therefore be regarded as an attenuation. We will prove an injectivity 
result for weighted X-ray transforms even for very rough weights in 
section [7| and we need not assume that the weight is an attenuation. 
A more detailed discussion of X-ray transforms is given in section [7J 


The following result is based on weighted X-ray transforms with 
scalar weights. We present the theorem in the amount of generality al¬ 


lowed for by our result on local weighted X-ray transforms, theorem 7.3 


Results for such transforms are known also on manifolds [331E2J , so 
the result can be easily generalized. 


Theorem 5.4. Let id C M. n , n > 3, be a strictly convex bounded smooth 
domain and write M = fh Suppose the sets C N , N > 2, give 

ideal data. Fix any a > 0. Let H 0 ,G: M —> C NxN be C 1,a -smooth 
and pointwise hermitean. Let H(x) = H 0 (x) + f(x)G(x ) for a function 
f G C 1,a (M). Assume that we know \a*W H b\ for all a G &, b G J" and 
every line 7 : [0, T] —* Q through Ft. If H 0 and G are known and G is 
nowhere a multiple of the identity, then the data uniquely determines 
the function f. 


Proof. By lemma T3 we can assume Ho and G to be trace free, 
trace free part of G does not vanish anywhere. 

Suppose two functions /, / G C 1 ,a (M ) give the same data. As in 
we get 


The 

<§. 


( 8 ) [ T (f( 7«) - fhmui(T. t)GMt))Ul(t, 0)d t = 0 

Jo 

for all lines 7 : [0, T] —y M with endpoints on dM. If the line goes 
through x e M and has direction v e S '” -1 at time t, let us denote 


Wij(x,v) = [U]j(T,t)G( 'y(t))Ul(t,0)\ ij 

for all indices 1 < i,j < N. In this way we can define Wij(x,v) for all 
(x, v) G SM. 

By ([ 8 ]) know that the scalar function g — f—f has vanishing weighted 
X-ray transform for each of the weights Wij. Since the matrix whose 
coefficients are is nonzero, the weights cannot vanish simultane¬ 
ously. The weights are also C 1,a on SM, so every point (x, v ) G SM 
has a neighborhood where one of the weights is bounded away from 
zero. I 11 particular, if x G DM and v is tangent to dM, there is such a 
neighborhood. Furthermore, since g is real valued, its weighted X-ray 
transform vanishes also with the weight Re (Any,) for any A G C; we 
can thus always locally find a strictly positive real weight for the X-ray 
transform. 
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Now, using theorem L3 with N = 1, we observe that / — / = 0 in 
some neighborhood of x. We may iterate this argument (a similar layer 
stripping method was presented in [62]) to show that / — / = 0 in all 
of M. The method presented in [62.J requires a foliation condition that 
is satisfied by a strictly convex Euclidean domain. (Instead of iteration, 
one can also argue by contradiction as in the proof of theorem |7.2[ ) □ 

5.3. Linearization. Since we have been able to give some answers to 
the full nonlinear problem |2.1[ we will only briefly discuss its lineariza¬ 
tion. 

The linearized problem will lead to a similar weighted X-ray trans¬ 
form problem that arose in the proof of theorem 5.2 The difference is 


that the weight will be known (dependent on the reference Hamiltonian 
at which we linearize) and therefore easier to invert. For completeness, 
we prove below that this is indeed the correct linearization. Lineariza¬ 
tion may be a relevant approach when the Hamiltonian is known to be 
close to a known Hamiltonian or when one wishes to track (slow) time 
dependence of the Hamiltonian. 


Proposition 5.5. Let H s be a family of smooth Hamiltonians in a 
domain hi C M n parameterized smoothly by s G (—£,£) for some £ > 0. 
Let 7 : [0,T] —* 0 be a line. Then 

ful .1,-0 = i [ U] Ia (T,t)f s HMm,.oUi 0 (t.O)dt. 

Proof. We can see from equation that for any s we have 

(9) Ul -XP Ha =i [ T l ^ 0 (T,i)(ff 0 (7(t)) - H s ( 7 (f)))f/^(f, 0 )dt. 

Jo 

Let us denote H'(x) = f^H s {x) | s=0 . Since H 0 ( 7 (f)) — ih s ( 7 (f)) = 
sHf'yft)) + o(s) and U]j a (t, 0) = Ufj Q (t, 0) + o(l) as s —> 0 (follows 
from ([9])), we have 

Uh s — Ufj o — is [ T U] Io (T,t)H\ 1 (t))Uj Io (hO)dt + o(s) 

Jo 

as claimed. □ 


6. Unordered time evolution 

In section [2] we introduced the time evolution operator and mentioned 
that it could be expressed as a time ordered exponential. We will now 
investigate what happens if we replace the time ordered exponential 
with the usual matrix exponential. This problem is no longer quantum 
mechanically relevant, but it makes an interesting mathematical inverse 
problem all the same. 

If the Hamiltonian H satisfies [H(x), H(y)\ = 0 for all x,y E Q, then 
the time ordered and the usual unordered exponentials of all integrals 
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of H coincide. We do not assume this commutator property in this 
section, and therefore the time ordered and regular matrix exponentials 
are different. 

We have a set of initial states J? C C N and a set of final states 
& C and for all lines 7 : (0, T) —>■ with endpoints on dfl we 
know the inner products 


<f>* exp 


Hh(t))dt T 


for all <I> £ .AT and T e The only difference to problem 2T 
that Texp is replaced with exp. 

In the case of ideal data this means that the matrix 

rT 


is 


( 10 ) 


exp ( —i / //( 7 (t))dt 


can be recovered up to a phase. For ideal data, the unordered problem 
is now this: 


Problem 6.1. Let fl C M n be a domain. 

7: [0, T] —> with endpoints on dfl we blow the matrix (10) 

much can we infer about the Hamiltonian H? 


If for all line segments 
how 


This problem quickly leads to this auxiliary problem: 


Problem 6.2. Let A, B be two hermitean N x N matrices. When do 
we have e lA = e lB 9 Can we classify the pairs of matrices that have the 
same exponential? 


We give a solution to this problem in proposition |6.3| below, but, 
unfortunately, this result is difficult to work with. Therefore instead 
of the full problem we treat the linearization. Dropping time ordering 
from the exponential makes a huge difference in the linearization, as we 
shall see below. The answer to problem 6D, linearized at any constant 
Hamiltonian, is given in theorem 6.13 below. 


6.1. When do two matrices have the same exponential? 

some integer N > 1. For an N x N matrix A, let us denote its 
eigenspace corresponding to the eigenvalue A G C by 

E a = ker(H - XI). 

Let us also define the “periodic eigenspaces” 

D = © 7+2* 

fcez 

With these spaces we can formulate a simple answer to problem |6.2 

Proposition 6.3. Let N > 1 be an integer and A and B two her¬ 
mitean N x N matrices. Then the following are equivalent: 

(1) e lA = e lB , 
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(2) F A = Ff for all XeR. 

Proof. 2 =>■ 1: Let 0 < Ai < A 2 < • • • < X n < 2t be the numbers 
for which dim(Fj^) > 0 and denote the corresponding spaces by Sk = 
F A k = F B . Now C n is an orthogonal direct sum of the spaces S±,... ,S n 
and the matrices A and B are block diagonal in this decomposition 
(that is, A{Sk), B{Sk) C Sk for all k). Let Ak, 1 < k < n, denote the 
corresponding block of A, and similarly Bk, Uk and 14 for B, U = e lA 
and V = e lB . 

Considering kill block only, we have e lAk = Uk and e lBk = 14. All 
eigenvalues of Ak are in A k + 27tZ, so the only eigenvalue of Uk is e lXk . 
Similarly, the only eigenvalue of 14 is also e lXk , so Uk = e lXk I = 14- 
Since Uk = 14 holds for each k, we have arrived at U = V. 

1 =>- 2: Let us show that F A = E^t for all A G M. The desired 
conclusion follows from this claim. 

The inclusion F A C E e \ x is elementary and holds also in the case 
Ef x = 0. But 

dim(Ff) = ^dim(Ff) = N = ^ dim(F e e a), 

0<A<27r AgR 0<A<27t 

so we must have dim(F^) = dim(F^). This concludes the proof of 
the claim. □ 


Remark 6.4. The matrix exponential map is not a homomorphism (cf. 
the Baker-Campbell-Hausdorff formula), so finding which matrices 
have the same exponential cannot be simply reduced to finding the 
kernel of the exponential map. In the simple case where [A, B] = 0 it 
can be checked that e lA = e lB if and only if the spectrum of A — B is 
contained in 27 tZ, but neither direction of this result remains true if we 
assume no commutativity. 


6.2. Preparations for linearization. Let us fix an integer N > 1 as 
before. Let SPL and U denote the sets of skew-hermitean and unitary 
matrices of dimension N. We define the exponential map exp: SPL -A- 
U by exp(A) = e A = J2kLo h ^ k • F° r A E SPL, we define the adjoint 
map ad^: SPL — > SPL by ad^F = [A,B], We also define the adjoint 
map Ad{/: SPL -E SPL by Ad*r A = U AU~ l for U ElA. 


Let ip(z) =z 1 (1 — e z ) = J2 k =c 


(-W 


=0 (fc+l)! 

can also define the operator ^(ad^): SPL —> SPL 


z k . Using this power series we 


The solution of problem 6.2 given by proposition 6.3 is difficult to 


work with, so we linearize the problem. For this we will need the de¬ 
rivative of the exponential. The following two lemmas are well known, 
but we record them explicitly for the sake of an easy reference. The rest 
of this section is devoted to lemmas which we will need in section 16.31 


Lemma 6.5. The derivative of the exponential map is dexp A = e A Lp(a,&A )■ 
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Lemma 6.6. Ad exp (A) = e adA for any A G SFL. 

Lemma 6.7. For any A G we have SFL = keracU©imacU. 


Proof. We can assume A to be diagonal. We fix a basis so that we can 
write A in the block form 


(ii) 



A2 h 2 


\ 


\ 




/ 


where Ai < A 2 < • • • < A m are the distinct eigenvalues of — iA and Ik 
is the k x k identity matrix. 

The kernel ker ad^ consists of block diagonal matrices whose blocks 
are of size k\ x k\, k 2 x k 2 ,..., k m x k m (ie. with block structure cor¬ 
responding to that of A). For a complex k\ x k 2 matrix C, let us 
denote 


( 12 ) 


/° 

C* 


B c = i 


C 

0 


V 


\ 

0/ 


It now suffices to show that the matrix Be, for any C , is in the image 
of ad^; all other matrices with zero block diagonal (corresponding to 
the block structure of A) can be expressed as sums of matrices of this 
form. Now a simple calculation shows that [A, Be] = (Ai — A 2 )Bic- 
Therefore B c = [A, (Ai — A 2 ) _1 i?_ic] G imad^. □ 


Lemma 6.8. For A G SFL define Fa '■ SFL — > SFL by Fa — id — Ad e -A. 

We always have ker ad^ fl ini Fa = {0}. Furthermore, SFL = ker ad^ © im Fa 
if and only if no two eigenvalues of —iA differ by a number in 2 ttZ\{0}. 


Proof. Let us write A again in the block form (11). It is clear that ker ad^ 
consists of block diagonal matrices (with block structure corresponding 
to that of A). Let us see why such matrices are not contained in the 
image of Fa- 

Let Be denote the off-diagonal matrix of (12) as in lemma 6.7 


(Again we can without loss of generality only consider the first two 
eigenvalues; other pairs can be dealt with similarly.) Now a calculation 
shows that 


(13) FaB c — B {1 _ e ix 2 -ix 1)c . 

Hence im Fa contains only matrices with zero diagonal, and ker ad^ fl im Fa = 
{0} ' 

Let us then suppose that the condition on eigenvalues of —iA holds. 

Then 1 — e * A2_ * Al ^ 0, so Be = Fa( 1 — e* A2_ * Al ) _1 Hc G im Fa for all C. 

This implies that SFL = ker ad ,4 © im Fa ■ 
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Proof. Using lemma 6.5 we have 

dexp A (£> + C) = e A <p(acU)(i? + C). 


Now &(1aB = 0 so = B. By lemma 6.6 we have 

<p(acU)C = <p(acU) acU D 
= (1 -e~ adA )D 
D Ad eX p(—A)B 
= D - e~ A De A . 


□ 


Suppose then that SPL = ker ad 4 0 irn Fa ■ This implies that for 
each k\ x fc 2 matrix D there is a matrix C of the same dimensions 
so that FaBc = Bn. Due to the identity (13) this is requires that 
^ 1; so Ai - A 2 i 2vrZ. 

Lemma 6.9. We have 

dexp A (S + C) — e A B + [ e A , D\ 

for any A e SPL. B e kerad^ and C = ad aD e imadA- 


Combining these gives the claimed equation. 


□ 


Lemma 6.10. For any A G SPL the following are equivalent: 

(1) The derivative dexp A is a bijection. 

( 2 ) All matrices that commute with e A commute with A. 

(3) For any distinct eigenvalues iX and igi of A we have /r—A ^ 2i rZ. 


Proof. By lemma HI we have SP L = ker ad a © im ad a- Let 


B G keradA and C G SPL. By le mm a 6.9 e" A dexp A (B + slAaC) = 
B + C — e~ A Ce A . Thus by lemma 6.8 dexp^ is bijective if and only if 
condition [3] holds. 

@<=>! Let us change the basis so that A comes to the block form 

A4 \ 

, . A 2 4 2 

A = 1 


V 


XmPk, 


J 


where Ai < A 2 < • • • < A m are the eigenvalues of — %A and Ik is the k x k 
identity matrix. Then 


e 


A 


(e iXl I kl 

e iX2 h 2 


V 


\ 

e iXm h m ) 


Now it is clear that there are matrices that commute with e A but not A 
if and only if e lXi = e lXj for some i 7 ^ j. But this is equivalent with 
Ai - A j G 27tZ. □ 
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Lemma 6.11. For any A £ SFL the set 

Sa = {£ € M; d exp M is not bijective} 
is countable and discrete. 

Proof. Let cr C M be the set of eigenvalues of iA. Then by lemma 6.10 
Sa = {2irz/(\ — //); z £ Z \ {0}, A, fi £ cr, A ^ /a}. 

This set is clearly countable and discrete. □ 


6.3. Solution of the linearized problem. We are now ready to solve 
a linearized version of our problem. Physically linearization amounts 
to assuming the Hamiltonian to be very close to some known Hamil¬ 
tonian H 0 . We make the additional assumption that H 0 is a constant 
matrix, independent of the position x £ Si. We believe that the lin¬ 
earized problem can be solved in a similar way even for nonconstant Ho 
(at least generically if not for every Ho in every domain). 

Denoting by H the small deviation from Ho , we see from equa¬ 
tion (10) that the linearized problem is to recover H from the knowledge 
of 

r T 


d exp_j. 


iTH o 




up to a multiple of e lTH °. We will need a little geometrical lemma to 
solve this problem. 


Lemma 6.12. Let 12 C R n be a domain. Then for every r £ R. the set 
A r = {(x,y) £ <912 x <912; d(x,y) = r} C <912 x <912 is closed and has no 
interior points. 

Proof. Closedness follows from the continuity of the distance function 
d: <912 x <912 -A M. Since A r = 0 for r < 0 and Ho is the (closed) 
diagonal of <912 x <912, it remains to consider r > 0. 

Suppose (x, y) is an interior point of A r for some r > 0. Then there is 
6 > 0 so that d(z,w) = r for all z £ B(x, 8 )r I dfl and w £ B(y,5)r\ dfl. 
Fixing z = x shows that near y the boundary dSl is a sphere centered 
at x ; and similarly near re it is a sphere centered at y. Let L be 
the line containing x and y. Let LI be a line parallel to L with a 
distance less than 8 to it. Let z be the unique point in the intersection 
L' fi B(x, 5) fl dSl and similarly w £ L' fl B(y, 5) D dSl. Then clearly 
d(z,w ) < r, a contradiction. □ 

Theorem 6.13. Let 12 C ML, n > 2, be a bounded domain. Let 
H £ C(Sl,C NxN ) take values in hermitean matrices and let H 0 be a 
fixed hermitean matrix. Suppose that for every line 7 : [0,T] —» 12 with 
endpoints on <912 we know 

e iTH °dexp_ iTHll i T H(-,(t))dt 
Jo 
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up to an unknown multiple of the identity matrix. Then we can recover 
the trace free part of H(x) for every 16 O. 


Proof. Let us denote by A the space of trace free hermitean N x N ma¬ 
trices. We can then decompose the unknown matrix function uniquely 
as H{x) = A(x) + h(x)I where A(x) G A and h(x) G M. Both A and h 
are continuous functions. 


By lemmas 6.11 and 6.12 we know that dexp d / E y ^ Ho is injective for 
a dense set of pairs (x, y) G dfl x dSl. By continuity and density, we 
can recover the integral 


A(7(t))dt 


6.9 


e ,TH °dexp_- 


iTH 0 


for every line 7 : [0, T] — y 0. (Note that by lemma 
maps multiples of identity to multiples of identity?} By injectivity of 
the X-ray transform in Euclidean domains (see eg. na m ns d) , we 
can recover A everywhere in Q. □ 


There may be lines for which the corresponding derivative of the 
exponential is not injective. The set of singular lines has no interior 
points so we were able to use a density argument, but we suspect 
that this renders recovery somewhat unstable. A stability result for 
the linearized problem would give a local uniqueness result for the full 
problem. 


7. X-RAY TRANSFORMS WITH MATRIX WEIGHTS 

Let M be the closure of a compact, smooth domain in M n . Let 
SM = M x S' 11-1 denote the unit sphere bundle on M and L the 
manifold of maximal directed lines in M, which is the quotient of SM 
by the geodesic flow. Let ttm ■ SM —» M and 717 : SM —» L be the 
projection and the quotient map. 

Let W : SM —y C NxN be a continuous matrix function. We define 
the weighted X-ray transform Iwf '■ L —y C N of a continuous function 
/: M —>■ CA by 

Zwf(l)= [ W / (7( i ),7(^))/(7(^))dt 
Jo 

for any line 7: [0, T] —> M. 

We will first restrict our attention to the case n — 2 and later use 
results from this case for higher dimensions. 

For any p G M such that W(p,v) is invertible for all v G S n_1 there 
is a continuous function Q\ v : L —> C NxN so that ( ^lQw)W = I on 
the fiber S P M = {p} x S'” -1 . This will allow us to make the weight 
trivial at any single chosen point p G Min all directions. 
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Let I a denote the Riesz potential 


I a f(x) 


Ca 


X 2 \y \ 2 


f(x + y)dy. 


The size of the constant c a is irrelevant for us, but we remark that 
Ci = 1 / 2,71. Let P be the back projection operator defined on functions 
g: L -A by Pg: M —> C* 


Pg(x ) 


1 

2tt 



* L g(x,v)dv. 


We will invert X w for suitable weights W by freezing the weight at 
a point near the boundary and using the identity PTjf — hf and 
injectivity of the Riesz potential in suitable spaces. This gives a proof 
of injectivity in two dimensions in a sufficiently small domain (where 
the weight is close enough to the identity matrix), which in turn gives 
a more general injectivity results in dimension three or higher. (Injec¬ 
tivity in small domains was proved for the attenuated X-ray transform 
in HU with smooth scalar attenuations.) Our proof is based on the 
ideas of [27], but we improve it in two important ways. We allow 
rougher weights; in particular, we need not assume that the weight has 
even a first derivative. Most importantly — from the point of view of 
problem |5.1| at least — we allow the weight to be any invertible matrix. 
A similar problem for X-ray transforms with matrix weights has been 
studied before by several authors [36[ CEB] I2H 33 ESj- There is also 
a recent preprint [42j with very general results for matrix weights on 
manifolds. 

Injectivity of X w requires that the weight W is sufficiently regular. 
Our regularity assumption is quite relaxed, as we do not require dif¬ 
ferentiability. Assumptions on the weight are less stringent in dimen¬ 
sion three and higher because the problem is formally overdetermined. 
There are counterexamples to injectivity if the weight is not regular 
enough; see |23 Si- 

Lemma 7.1. Suppose 0 G M C M 2 . Take any /3 > 0 and consider a 
weight W which satisfies W G C l3 (M, Lip(S' 1 , C NxN )) and W(0,v) = 0 
for all v G S 1 . There are p G (1, oo) and £ > 0 so that for any 5 > 0 
we have for any f G L P (B( 0, h) D M, C N ) 


\\VPXwf\\ p < C P S S 11 W 11 qp ( M, Lip( s 1 ,c NxN )) ll/llp, 

where C p is a constant depending only on p. 

Proof. Let us denote by < inequalities including constants that only 
depend on the exponents. Let us assume that / is supported in B( 0, S). 
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Let i G {1, 2} and j = 3 — i. A calculation gives 


diPX w f{x) = —di / y-W(x + y, y/ \y\)f(x + y)dy 


2ir 


\y\ 


= / Ti[diW(x + y,y/\y\)f(x + y) 


2vr 


|y| 


W(x + y, y/ \y\)dif{x + y)]dy 


1 

27T 


p,W(a: + S/,J//|s/|) 

11/1 

■l)*rT 9 wW'(a; + 2/,j//|j/|) + j/)dj/. 


|y| 


|y|‘ 


(We may use a simple approximation argument to justify the above 
calculation with the assumed regularity of W and /.) Therefore, if 
K(x) = |^| sup„ eS i(|W(a;,u)| + \d v W{x,v)\), we have 


\VPI w f(x)\ 


< 


/ Ii(x + y)—\f(x + y)\dy. 

I b(—x,s) \y\ 


By Holder’s inequality, 

\vpx w m\ < ii*iU'< B( „,,„ (h~k i/i* wu* 

for any k G [1,2). (The proof given in [27J uses this estimate for k — 1 


only, which leads to a more restricting regularity assumption.) By the 
Hardy-Littlewood-Sobolev inequality 

II4<?IL - llyll,. 

where a G (0,2), q = 2s/(2 + as) and s G [l,oo). (The requirement 
q < 2/a is always satisfied.) 

Combining our estimates with s = p/k and a = 2 — k, we have 

. .... i/fc 

h-k 


\VPl w f(x)\\ Lr(R2) < ||A-|| 

L k '(B{0,6)) 
\\K\\ L k '(B( 0,5)) 


LP/ k (R 2 ) 


L/f 


1/fc 


L9(M 2 ) 

= 11^- lllA'(B(0,<5)) II / II L k v(B(0,5)) ■ 

Assuming kq < p (or equivalently 2 < (2+p)(2—k)) Holder’s inequality 
gives 

2+2p — kp — 2k 

II/IIzA9(.B(0,c5)) ~ kp II/IIlp(B(0,<5)) ■ 

If k < 2, the condition 2 < (2 + p)(2 — k) is satisfied for sufficiently 
large p, so that 

_ 2+2p— kp—2k 

(14) ||VPXvi//(x)|| iP ( K 2) < \\K\\L k '(B(0,S)) $ kp II/IIlp(B(0,<5)) ■ 

We therefore want to have K G L S (M) for some s > 2. 
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Since W £ C&(M, Lip (S' 1 , C NxN )) and W (0, v) = 0 for all v, we have 
K(x) < \x( 3 ^ 1 . If we choose s £ (2, 2/(1 — (3)), we get K £ L S (M). It 
remains to choose p so that 2 < (2 +p)(2 — s'). Inequality is now 
the desired estimate. □ 


We are now ready to give an answer to problem |5.1 
rem 


Using theo- 


5.2 


we can determine the trace free part of the Hamiltonian H 
from our quantum measurements, provided that H £ U 1,/3 for some 


(3 > 0. This was stated as theorem 2.3 


Theorem 7.2. Let M be the closure of a bounded convex domain in 
n > 3, and let (3 > 0. Let W £ C l3 (M,Lip(S n ~ 1 ,C NxN )) be invertible 
at every point on SM = Mx S'" --1 . (This regularity assumption is true, 
in particular, if W £ C 1, ^(SM,C NxN ).) If f £ C(M,C N ) satisfies 
T\yf = 0, then f = 0. 


Proof. Suppose / ^ 0 but Twf = 0. Take p in the intersection of spt(/) 
and the boundary of its convex hull so that there is a subtangent plane 
of the convex hull at p touching the support only at p. We shrink M 
to be this convex hull. Let v £ S'” -1 a vector pointing inward from p. 

For t > 0 denote x t = p + tv and suppose t is so small that x t £ 
int M. Let G be the set of two dimensional subspaces of the chosen 
tangent plane at p. Denote Mf = (x t + T) D M for t > 0 and T £ 
G. Let 5t be the maximum of diameters of Mf , T £ G. We will 
consider the weighted X-ray transform of / on each Mj, and for the 
sake of simplicity we will consider T £ G fixed and suppress it from 
the notation. To that end, let us denote the restrictions of / and W 
to M t by f t and W t , respectively, and the set of lines on M t by L t . 

Define Q t : S 1 —» C NxN by Q t (v) = W t {x t ,v)~ l . This is a Lips- 
chitz function, and we may define another Lipschitz function Q t : L t —X 
C NxN by letting Q t (£) = Qt(vf), where V£ £ S 1 is the direction of the 
directed line i. The Lipschitz constant of Q t is uniformly bounded in t. 

Since T Wt ft = 0, we have Z (7r , Q t ) W Jt = Qt%w t ft = 0. Let us denote 

W t = (n* Lt Q t )W t . The function n* Lt Qt- M t x S 1 -A C NxN is Lips¬ 
chitz (uniformly in t) and only depends on the direction, so we have 


Wt 


< 


CP(M,Lip(S 1 ,C NxN )) 


1 uniformly in t and W t (x t ,v) = I for all 


v £ S 1 . Using lemma 7.1 we have 


C\\f t \\ p <\\Vhf t \\ p 
= \\VPIift\\ p 

= pPl W J t + VPlj^yJt 
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for a sufficiently large fixed p and some constants C,c,e > 0. Note 
that this estimate is uniform in T G G. Since St — > 0 as t —» 0, there is 
t 0 > 0 is such that cSf < C/ 2 for all t < t 0 . For t < t 0 we get thus 

(15) F ll/dlp < \\VPIwJt\\ r - 

But ft = 0 so ft = 0 for t < t 0 . This means that / = 0 in 
Uo<t<t 0 Uteg which is a neighborhood of p. Due to our choice of p, 
this is impossible. Therefore Iwf = 0 is only possible if / = 0. □ 


It is not actually necessary that W ( x , v) is invertible for all (x, v ) G 
SM. The proof above only used invertible for v in a subspace home- 
omorphic to S n ~ 2 of S'" --1 , given x. If one wants to recover a function 
from its integrals over some set of lines L' C L. it can only be done 
stably if the normal bundle of L' covers SM (see eg. m sans]). 

we see that we have in fact also 


Inspecting the proof of theorem 7.2 


proven the following local result. A similar result on manifolds (with 
n > 3, N = 1 and W G C°°) was shown by Stefanov, Uhlmann and 
Vasy Corollary 3.2], As mentioned above, an earlier Euclidean 
version can be found in [27J (for C 2 scalar weights) and in (for C°° 
scalar weights using microlocal tools). 


Theorem 7.3. Let M be the closure of a bounded convex C 1 domain 
in W 1 , n > 3, and let > 0. Let W G C^(M, Lip(S’ n_1 , C NxN )) be a 
weight. Fixp G DM. Suppose p has a neighborhood U so that W (x, v ) is 
invertible for all x EU and v in a neighborhood of tangential directions 
at p. Suppose for f G C(U, C N ) we have Lwf(l) = 0 for all lines 7 
that stay in U. Then there is a possibly smaller neighborhood U' C U 
of p so that f\u' = 0. 


The proof also gives a stability estimate for the local problem; cf. (15). 
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